GAUSSIAN MARGINALS OF CONVEX BODIES WITH SYMMETRIES 



MARK W. MECKES 

Abstract. We prove Gaussian approximation theorems for specific k- dimensional marginals of 
convex bodies which possess certain symmetries. In particular, we treat bodies which possess a 
1-unconditional basis, as well as simplices. Our results extend recent results for 1-dimensional 
marginals due to E. Meckes and the author. 



1. Introduction 

Let K be a convex body in the Euclidean space W 1 , n > 2, equipped with its standard inner prod- 
uct (-, ■) and Euclidean norm | • |, and let /x denote the uniform (normalized Lebesgue) probability 
measure on K. In this paper we consider /c-dimensional marginals of fi, that is, the push-forward 
(i o Pg 1 of /i by the orthogonal projection Pg onto some /c-dimensional subspace E C W 1 . 

The question of whether every convex body K C R n has 1-dimensional marginals which are close 
to Gaussian measures when n is large is known as the central limit problem for convex bodies, and 
was apparently first explicitly posed in the literature in [H [Bj. A natural extension is to ask, for 
how large k < n does K necessarily possess nearly Gaussian /c-dimensional marginals? The latter 
question can be thought of as asking for a measure-theoretic analogue of Dvoretzky's theorem, 
which implies the existence of nearly ellipsoidal /c-dimensional projections of K when k <C logra. 

Very recently Klartag [121 [13] showed that any convex body has nearly Gaussian /c-dimensional 
marginals when k < cn a , where c > and < a < 1 are some universal constants; closeness of 
probability measures is quantified by the total variation metric and also has a power-law dependence 
on n. This points out an important difference from Dvoretzky's theorem, in which it is known that 
for an arbitrary convex body k can only be taken to be logarithmically large in n. Klartag's work 
followed partial results, involving different additional hypotheses and metrics between probability 
measures, by many authors; we mention [U fl9j l22l [141 [9] among recent contributions and refer to 
[12j for further references. 

In much of the work on this problem, including the main results of \12\ 113] . the existence of 
nearly Gaussian marginals [i o P^ 1 is proved nonconstructively, so that no concrete such subspace 
E is exhibited. This is typical of the proofs of Dvoretzky-like results. In [18], E. Meckes and 
the author used Stein's method of exchangeable pairs to prove Berry-Esseen theorems for specific 
1-dimensional marginals of convex bodies which possess certain types of symmetries. Roughly, 
under some additional hypotheses, [18] shows that a 1-dimensional marginal \i o P^ 1 is nearly 
Gaussian when K possesses many symmetries a : W 1 — > W 1 for which the 1-dimensional subspace 
o~(E) C W 1 is very different from E. In [12\ I14j. another approach is used to study the marginal of 
a 1-unconditional body on the subspace spanned by (1, . . . , 1); see the remarks at the end of this 
paper for the relationship between these approaches. 

The main purpose of this paper is to prove versions of the results of [18] for /c-dimensional 
marginals with k > 2, using a new multivariate version of Stein's method of exchangeable pairs 
due to S. Chatterjee and E. Meckes [7]. Our results show that, in contrast to the situation for 
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Dvoretzky's theorem, in the measure-theoretic setting one can identify specific well-behaved high- 
dimensional projections for large classes of convex bodies. We consider bodies K which are 1- 
unconditional, or which possess all the symmetries of a centered regular simplex. Another purpose 
of this paper is to point out how some of the methods used here improve quantitatively some of 
the results of [18]. In [TSj a symmetry hypothesis was introduced which simultaneously generalizes 
1-unconditionality and the symmetries of a regular simplex, described in terms of a normalized 
tight frame of vectors. For the sake of transparency we have preferred to treat these special cases 
independently here, although that more general setting could also be treated with the methods of 
this paper. 

Many of the results in this area treat marginals of probability measures \x more general than 
uniform measures on convex bodies; in particular the methods of [18] apply to completely arbitrary 
probability measures which satisfy the symmetry hypotheses. One common generalization, treated 
in |12|, I13j for example, is to log-concave measures, i.e., measures with a logarithmically concave 
density with respect to Lebesgue measure. This is a natural setting since marginals of log-concave 
measures are again log-concave. While some of the methods of this paper apply to general prob- 
ability measures, we have chosen to restrict to the log-concave case, in which stronger results are 
possible. 

The arguments in this paper are a synthesis of the methods of the papers P2J El Q21 [H]. The 
proofs of the main results generalize the arguments of [18] in order to apply an abstract normal 
approximation result in [JJ. In order to derive stronger results for log-concave measures, we apply 
a concentration result from [14] and adapt a smoothing argument from [121 [9] . 

The rest of this paper is organized as follows. After defining some notation and terminology, in 
Section [2] we state and discuss our main results. Section [3] presents and develops our tools. Finally, 
in Section U] we prove our main results and make some final remarks about our methods. 

Notation and terminology. It will be convenient to frame our results in terms of random vec- 
tors rather than probability measures. We use P and E to stand for probability and expectation 
respectively, and denote by E[Y| A] the conditional expectation of Y given the value of X. 

Throughout this paper X = (X\ , . . . , X n ) will be a random vector in W 1 , n > 2. A random vector 
is 1 -unconditional if its distribution is invariant under reflections in the coordinate hyperplanes of 
R n . By Z we denote a standard Gaussian random vector in R n with density 



with respect to Lebesgue measure, or a standard Gaussian random variable in R; the usage should 
be clear from context. A random vector X is called isotropic if it has mean and identity covariance: 



Observe that if X is isotropic then E|A| 2 = n. Isotropicity is a natural assumption in this setting 
since it is preserved by orthogonal projections and Z is isotropic; see |19j however for recent work 
demonstrating that a nonisotropic affine image of X is more useful in some contexts. 

The total variation metric on the distributions of random vectors in M. n may be defined by the 
two equivalent expressions: 




EX = 



EX i X j = 6 ij . 



d TV {X,Y) = 2sup{|P[A G A) - P[y e A)\ : A C R n measurable} 
= sup{|E/(A)-E/(F)| : /€C c (n ||/|U < l}. 
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The normalization stated here is the conventional one in analysis and differs by a factor of 2 from 
a normalization used frequently in probability texts. Note that dTv{X,Y) = \\f — g\\i if X and 
Y possess densities / and g respectively. The (Li-)Wasserstein metric is defined by requiring test 
functions to be Lipschitz instead of bounded: 

d 1 (X,Y)= S up{\Ef(X)-Ef(Y)\ :Vx,yeR n , \f(x)-f(y)\<\x-y\}. 

Note that d\ metrizes a weaker topology on probability measures than dxvi but quantitative results 
for these two metrics are not directly comparable. In particular, dxv{X,Y) < 2 always, but the 
typical order of magnitude of d±(X,Y) is y/n. 

For x G M n , \\x\\ p = (Ya=i Nir") if 1 < p < oo. Except where noted, symbols c, C, etc. denote 
universal constants, independent of n, k, and the distribution of X, which may differ in value from 
one appearance to another. 



2. Statements of the main results 

Let 6{ = (Oj, . . . , Of), 1 < i < k, be a fixed collection of k orthonormal vectors. Given an isotropic 
random vector X G M. n , define 

(2.i) w l = (x,e i ). 

Then W = iW\, . . . , Wfc) G M fc is isotropic, and is essentially the same as Pe(X), where E is spanned 
by #i, ... , More concretely, W = T(Pe{X)), where T : W 1 — ► R fc is the partial isometry given 
by the matrix whose ith. row is 6% . Theorems 12.11 and 12.21 give bounds on the Wasserstein and total 
variation distance of W from a standard Gaussian random vector Zgl*. 

Theorem 2.1. Let X G K n be 1-unconditional, log-concave, and isotropic, and let W G M fc be as 
defined in (|2.ip . Then 



di(W,Z) < 14 

and 



i 



9-H 2 



i=i 

fe \ 1/3 



drv(W^ Z) < Ck 5 ^ f ll^illi^ 
^ i=l ' 



Before stating our other main results we will makes some remarks to put the bounds in Theorem 
12.11 in perspective. To begin, assume for the moment that 

(2.2) \el\= n ~ l/2 

Theorem 12.11 then shows 

(2.3) dl{W ,Z)<U-^- 
and 

k 7/6 



(2.4) d TV (W,Z)<C- 

In particular, d\(W, Z) <C 1 as soon as k < n 1 / 4 and d TV (W, Z) <C 1 as soon as k <C n ' . 

If n is a power of 2 then (|2.2p will be satisfied if {\/n9i : 1 < i < k} are the first k vectors in 
a Walsh basis for R n . For arbitrary n it is not necessarily possible to satisfy (|2.2p for 0±, . . . , Of. 
orthogonal. However, up to the values of constants, (|2.3p and (|2.4p will be satisfied by letting 
{y/mOi : 1 < i < k} be the first k vectors in a Walsh basis for M. m C R n , where m is the largest 
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power of 2 not exceeding n (so that m > n/2). In fact, a result of de Launey [8] shows that one 
can also obtain such a so-called partial Hadamard basis (0j) when m is the largest multiple of 4 
not exceeding n, as long as k < cm for some absolute constant < c < 1. Observe that this latter 
condition is necessary anyway for the bounds in (|2,3p and (|2.4p to be nontrivial. 

Moreover, at the expense of the value of the constants which appear, (|2,3p and (|2.4p hold for 
any orthonormal basis Q\,...,6k of most /c-dimensional subspaces E C IR n . This statement can 
be made precise using a concentration inequality on the Grassmann manifold G n ^ due to Gordon 
[TT] , cf. [181 Lemma 16], although we do not do so here. 

The error bounds in Theorem l2.1l depend on a recent optimal concentration result for 1-unconditional 
log-concave random vectors due to Klartag [2] , given as Proposition 14.11 below. Klartag used a 
more general version of that result to give a sharp estimate on Gaussian approximation with re- 
spect to Kolmogorov distance (maximum difference between distribution functions) in the setting 
of Theorem 12.11 when k = 1; the typical error is of the order 1/n. Using a smoothing lemma from 



A Wasserstein distance estimate as in Theorem 12 . 1 1 can be proved without the assumption of log- 
concavity, at the expense of explicitly involving Var(|X| 2 ) in the bound, and (for technical reasons) 
making some stronger symmetry assumption on the distribution of X. The smoothing arguments 
involved in proving the total variation estimate, however, depend more crucially on log-concavity. 

We now proceed to our other main results. 

Theorem 2.2. Let X £ M. n be uniformly distributed in a regular simplex 

(2.5) A n = yjn(n + 2) conv{t>i, . . . ,v n }, 

where \v{\ = 1 for 1 < i < n + 1, and let W E M fc be as defined in (I2.ip . Then 



Theorem 12.21 shows that W is approximately normal as long as the vertices of A n are not close to 
the subspace E = span(#i, . . . , 6^). By the remarks following Theorem 12. II above and |18t Corollary 
6], Theorem 12.21 shows that for a typical subspace E, dTv(W,Z) < c^Jj^. The same proof as for 
Theorem 12.21 yields similar results for random vectors with other distributions invariant under the 
symmetry group of a regular simplex. 

Our last main result improves the typical dependence on n of the total variation bound of 
Theorem 12.21 in the case that k = 1 . 



Theorem 2.3. Let X be uniformly distributed in a regular simplex A n as defined in (|2.5p . let 
6 £ S™- 1 be fixed, and let W = {X, 6). Then 



[5] this implies a total variation estimate (which may not be sharp) of the order 





and 




1/3 



n+1 




where Z is a standard Gaussian random variable. 
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For a typical 6 6 S"™ -1 , we obtain here ££tv(Wj Z) < cn~ 1//4 . This also improves an error bound 
given in |18| ; see the remarks at the end of this paper for further details. 



3. Smoothing and abstract Gaussian approximation theorems 
For n > 1 and t > 0, define <p t : R n -> E by 

(27Tt2)n/2 e 

so is the density of tZ, where Z S M n is a standard Gaussian random vector. A well-known 
consequence of the Prekopa-Leindler inequality [16\ [21] (or see |10| ) is that the convolution of 
integrable log-concave functions is log-concave; hence in particular / * ipt is log-concave for any 
log-concave probability density / : W 1 — ► R + . Furthermore it is well-known that 

||/*¥*-/||i-0 asi^O 

for any integrable /. Thus log-concave random variables are arbitrarily well approximated, in the 
total variation metric, by log-concave random vectors with smooth densities. The statements of 
Section [2] involving the total variation metric rely on a quantitative version of this observation. We 
say that / : M. n — > R + is isotropic if it is the density of an isotropic random vector in R n . The 
following is a sharp version of Lemma 5.1 in |12j . 



Proposition 3.1. Let f : M. n — ► R+ an isotropic and log-concave probability density. 

(1) Ifn = 1 then 

11/*^ -/Hi < 2^2 t 

/or a// 1 > 0. 

(2) (Klartag-Eldan) Ifn>2 then 

\\f*<Pt ~ /111 < cnt 
for all t > 0, where c > is an absolute constant. 

Proposition 13.1 t |2j) was conjectured in an earlier version of this paper which also proved a weaker 
version of that estimate by optimizing over some of the parameters in the proof of |12l Lemma 5.1]. 
After that version of this paper was posted on arxiv.org, Klartag proved the conjecture; the proof 
appears in [9l Section 5]. 

For arbitrary /, Proposition 13.11 is sharp up to the values of the constants 2\/2 and c. For the 
particular case / = cp\, one can show 



(3.1) \\<p! *(p t - yi||i = Hyyi^ - villi < v2n t 

for all £ > and any n (cf. [12\ Lemma 4.9], or the proof of Proposition I3.1l fl]) below). 

Proof of Proposition \3. . First, we can assume that / is smooth and everywhere positive, for 
example by convolving / with ip £ , rescaling for isotropicity, and letting e — > 0. A special case of a 
result of Ledoux |15[ formula (5.5)] about the heat semigroup on a Riemannian manifold implies 
that 



(3.2) 



||/*Vt-/||i< V2i H/'lli 
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for any t > 0. Since / is log-concave it is unimodal, i.e., there exists an a € R such that f'(s) > 
for s < a and f'(s) < for s > a, and since / is also isotropic we have f(a) < 1 (see e.g. [XT} 
Lemma 5.5(a)]). Therefore 

/a poo 
f'(s) ds - / f'(s) ds = 2/(o) < 2, 
-oo J a 

which proves the first claim. □ 

The following abstract Gaussian approximation theorem was proved by Stein in [23] ; the version 
stated here incorporates a slight improvement in the constants proved in [2]. Recall that a pair of 
random variables (W, W') is called exchangeable if the joint distribution of (W, W') is the same as 
the distribution of (W, W). 

Proposition 3.2 (Stein). Suppose that (W, W) is an exchangeable pair of random variables such 
that EW = 0, EW 2 = 1, and E[W - W\W] = -XW. Then 

\Eg(W) - Eg(Z)\ < ^Var E[{W - W) 2 \W] + U^E|W" - W\ 3 

A 4A 

for any g G C^°(R) ; where ZgR denotes a standard Gaussian random variable. 

Stein used a smoothing argument to derive a version of Proposition 13.21 for the Kolmogorov 
distance, which was the main tool in the proofs of most of the results of [18J . Estimates for total 
variation distance for log-concave distributions were obtained in [18] by combining the Kolmogorov 
distance estimates with [51 Theorem 3.3], which entails an additional loss in the error bound. Here 
we use Proposition I3,1I |T]) to obtain a version of Proposition 13.21 for total variation distance and 
log-concave distributions, which matches Stein's bound for Kolmogorov distance used in [18] ; this 
is the main technical tool in the proof of Theorem 12.31 

Corollary 3.3. Suppose, in addition to the hypotheses of Proposition HOI that W is log-concave. 
Then 

d TV (W,Z) < j^YaiE[(W - W) 2 \W] + 2^jE\W - W\ 3 . 

It follows from the proof of Proposition l3.1l fT1) that Corollary 13.31 only requires W to have a 
bounded unimodal density with respect to Lebesgue measure; the coefficient 2 in the r.h.s. should 
be replaced by a constant depending on the maximum value of the density. 

Proof. Let g G C C (R) with ||g||oo < 1, an d let / denote the density of W. Assume for now that / 
is smooth. Given t > 0, define h = g * ipt. To begin, observe that 

IHloo < |M|oo[|<Pt[|l < 1, 

Halloo = \\g* 0/4) Hoc < lblloc||</4lli < 7\/~ 

t V vr 

Proposition 13.21 applied to h implies that 

(3.3) \Eh(W) - Eh(Z)\ < \^JV&rE[(W' - W) 2 \W} + -^L — E\W' - W\ 3 . 

A 2\/2vr Xt 



Next, by Proposition [3JJHJ , 

= J \g*<Pt(s) -g(s)]f(s) ds 
< ||3||oo||/*^-/||i<2v / 2i. 



\Eh(W)-Eg(W)\ 



[f*M s ) - f( s )Ms) ds 
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Similarly, by d£2J 

(3.5) \Eh(Z)-Eg(Z)\<\\g\\ 0O \\<p 1 *<p t -<p 1 \\ 1 <y/2y i \\it 

\/7T 



The only reason for using (|3.2p directly instead of applying Proposition l3.1l |Tj) here is to obtain a 
slightly better constant. Combining (|3.3p . (|3.4p . and (|3.5p yields 

(3.6) \Eg(W) - Eg(Z)\ < \y/VwK[(W' - W) 2 \W] + — ^ — E|W" - W| 3 + 2 ( V2 + -L ] t. 

A 2V27T At V v 71 " / 

The corollary, under the assumption that / is smooth, now follows by optimizing in t. The coefficient 
of 2 given in the second term in the statement of the corollary is not optimal and is given as such 
for simplicity. 

The corollary can be reduced to the smooth case with a convolution argument as for Proposition 
13-H although it is slightly more complicated because it is necessary to smooth not only / but 
the exchangeable pair (W,W). To do this, let Zi,Z2 be standard Gaussian random variables 
independent of each other and of (W, W), set Z = Z\, and set 



Z' = (l- X)Z X + yj2\ - A 2 Z 2 . 
Then (Z, Z') is an exchangeable pair and E[Z' — Z\Z] = —XZ. Now for e > let 

W e = -= L=(W + eZ), W ' e = -=L={W' + eZ'). 
yl+e v 1 + e 

Then (W £ ,W' e ) is an exchangeable pair that satisfies all the hypotheses of the corollary (log- 
concavity follows from the Prekopa-Leindler inequality), and W e has a smooth density. Applying 
the corollary to (W £ , W' £ ) and letting e — > yields the general case. □ 



The main technical tool in the proofs of Theorems 12.11 and 12.21 is the following multivariate 
version of Proposition 13.21 recently proved by S. Chatterjee and E. Meckes in [7]. For a smooth 
function / : M fc — ► R, we denote by M\(f) = |||V/|||oo the Lipschitz constant of / and 



M 2 (f) = ||||V 2 / 



2 , ,., 

oo 



the maximum value of the operator norm of the Hessian of /, or equivalently the Lipschitz constant 
of V/ : R k -> R fc . 

Proposition 3.4 (Chatterjee and E. Meckes). Let W and W' be identically distributed random 
vectors in M k defined on a common probability space. Suppose that for some constant A > and 
random variables Eij, 1 < i,j < k, 

E[W(-Wi\W] = -XW h 
E[(W! - Wi)(Wj - Wj)\W] = 2XSij + E i:j . 

Then 



\Ef(W) - Ef(Z)\ < — y^-IE^ 



for any smooth f : R fc — > R, where Z £ R fc is a standard Gaussian random vector. 
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Note that the normalization for used here differs from that in the statement of Theorem 
4] by a factor of A. An earlier version of this paper was posted to arxiv.org which was based on an 
earlier version of Proposition 13.41 The version given above allows improved estimates in Theorem 

EU 

Convolution arguments similar to those in the proof of Corollary 13.31 yield bounds on Wasserstein 
and total variation distances. 

Corollary 3.5. Under the same hypotheses as Proposition \3.4\ 



di(W,Z) < T E, 
A 



// moreover W is log-concave, then 

I./:; 

k. 



d TV (W,Z) < C I -E A 



where C > is an absolute constant. 



k i„2 



\ i,3=l 



Proof. To prove the first claim, let g : M. k — > M. be 1-Lipschitz, and define h = g * (ft for t > 0. 
Standard calculations show 

(3.7) M^h) < M^W^U < 1 
and 

(3.8) M 2 (h) < Mi(g) sup || (V(pt,0) ||i < \ -\. 

ee5 n-l \ TTt 

Note that Eh(W) = Eg(W + tZ), where Z £ M fc is a standard Gaussian random vector independent 
of W , which implies 

(3.9) \Eh(W) -Eg(W)\ < E\tZ\ < tVk 
since g is 1-Lipschitz, and similarly 

(3.10) \Eh(Z) - Eg(Z) \ < tVk. 

The claim follows by applying Proposition 13.41 to h, using (|3.7p . (|3.8p . (|3.9p . and (|3.10p . and 
optimizing in t. 

The proof of the second claim is similar. As in the proof of Corollary 13.31 we may assume that 
W has a smooth density /. Let g G C c (M. k ) with ||<?||oo < 1 ; and again define h = g * tpt for t > 0. 
By standard calculations, 

(3.11) Mi(/i) < Halloo sup \\(v<pt,e)\\i< 21 

and 



7T t 



(3.12) M 2 (/i) < IbHoo sup < 



-.1/1 /-A II. ^ 

t 2 ' 



Proposition I3.1t [2|) and the identity 
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imply that 

(3.13) \Eh(W) - Eg(W)\ < \\g\U\f * ft - /||i < ckt. 
Similarly, 

(3.14) \Eh(Z) - Eg(Z)\ < ||p||oo||Vl * ft - Villi < ckt 

This last estimate can be improved to y/2kt using (|3.1I) . but there is no advantage to doing so here. 
Applying Proposition S3] to h and using (I3TTTD . (13321) . (I3TT3D . and ([37L41 yields 

(3.15) d TV (W,Z) = sup \Eg(W)-Eg(Z)\<- + ^-+ckt 



<?eC c (R fc ), ||g|U<l 

for any t > 0, where 



12A 1 1 ' 7r A 



k 

<2 



Although it is not straightforward to optimize the r.h.s. of f)3. 15|) precisely, this is simplified by 
noting that cItv(W, Z) < 2 always. Therefore (|3,15p is vacuously true for t > 2/(cfc), and so 

<l TV (W,Z)< A + 2 ?JW +ck t 
for any t > 0. Optimizing this latter expression in t yields 

d TV (W,Z) < C(k 2 A + kB) 1/3 , 
from which the result follows. □ 



4. Proofs of the main results 

In this section we prove Theorems 12.11 and 12.21 from Corollary 13. 5^ and indicate how Theorems 
12.31 may be proved from Corollary 13.31 The arguments mostly generalize the proofs of |18j . 

First, observe that the Prekopa-Leindler theorem [21j 16J implies that marginals of log-concave 
measures are log-concave. Therefore when X is log-concave, W is log-concave as well, and the 
second estimate of Corollary 13.51 may be applied. This fact will be used without further comment 
in all the proofs in this section. 

Second, we state a version of Klartag's concentration result for unconditional convex bodies. 

Proposition 4.1 (Klartag). If X is isotropic, unconditional, and log-concave, and a%, . . . , a n 6R, 

then 

(n \ n 

t=\ J i=\ 

Proposition 14 . 1 1 is essentially a special case of [141 Lemma 4], which is stated with the additional 
assumption that a%, ... ,a n > 0. For the precise constants which appear here see the comments 
following the proof in p3]; an extra factor of 2 is introduced to allow negative coefficients by 
observing that 

Var(A + Y) < 2(Var X + Var Y) 

for any pair of random variables X and Y . 

We now proceed with the proofs of our main results. 
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Proof of Theorem \2.1\ To construct W appropriately coupled with W, we first define X' by re- 
flecting X in a randomly chosen coordinate hyperplane, and then let W[ = {X',6i}. By the 
1-unconditionality of X, X and X' are identically distributed and hence so are W and W . 

More precisely, let I be a random variable chosen uniformly from {1, . . . , n} and independently 
from the random vector X. Then 

X' = X - 2Xjej, 
where is the ith standard basis vector in W 1 , and 

W[ = {X- 2Xie T , 0i) = Wi- 2Q\Xj. 

It follows that 

E[Wi - WAX] = -- V 0f^Q = --Wi 



and 



Therefore we may apply Corollary 13.51 with A = - and 



Now by Jensen's inequality, Proposition 14.11 and the Cauchy-Schwarz inequality, 
1. 



A 



-IE , 



A 



\ ij=i V=i 



< 



v 2 E E« 

^ i,j=i «=i 



2(^)2 



< 8^2^11^111- 



1=1 



By the triangle inequality for the L 3 / 2 norm and a precise version of Borell's lemma (found, e.g., 
in [201), 



(4.1) 



/ k \3/2 / k x 3/2 

E|PF - W| 3 = Ef \ W 'i ~ W i\ 2 ) ~ (Yl ( E \ W i ~ W i\ 3 ) ) 

^(i(Ei^iw) 2/ f ^(l> 

\i=i v=i 7 / v i=i 



" \ 3/2 

'ilk 



By the standard estimates between 01 norms and the fact that ||0i||§ < 



(4.2) 



v p „™ ™„ ||-4 1I3 

fc x 3/2 _ k _ fc 

i=l 7 i=l i=l 



Proposition 13.51 now implies the stated bound for dyy immediately. For the bound on d\ observe 



also that \\di\\\ < \6i\ 2 = 1, and so YTi=i \\9i\W < k. 



□ 



GAUSSIAN MARGINALS OF CONVEX BODIES WITH SYMMETRIES 



11 



By using Proposition 13,41 directly, the proof of Theorem 12,11 above yields better bounds on the 
distance 

d 2 (W,Z) =sup{|E/(W0-E/(Z)| :Mi(/),M 2 (/) < l}. 
In particular, as in the remarks following the statement of Theorem 12.11 under the conditions of 
that theorem, typical /c-dimensional marginals are nearly Gaussian with respect to di if k <C n 1 / 3 . 
The same remark applies to the proof of Theorem 12.21 below. While we have preferred here to work 
with the more classical Wasserstein and total variation metrics, metrics like cfe based on smooth 
test functions are commonly used in quantifying multivariate Gaussian approximation. 

It is also worth pointing out here that [H] does prove multivariate Gaussian approximation 
results, but with respect to a weak metric referred to as T-distance which captures only the behavior 
of 1-dimensional marginals. Using T-distance yields misleadingly good results in terms of how large 
k may be for an approximately Gaussian marginal, cf. the remarks at the very end of [12] . Metrics 
like d,TVi di, and c?2 based on regular test functions better capture high-dimensional behavior. 

Proof of Theorem \2.2l In this case X' is obtained by reflecting X in a hyperplane spanned by (n— 1) 
vertices of A n ; alternatively one may think of this operation as transposing two vertices. 

We will need the well-known facts about vertices of centered regular simplices (which may be 
seen e.g. as consequences of John's theorem, cf. [3]) that 

n+1 

(4.3) 5>i = 

i=i 

and 

n+1 

(4.4) Yj ( x > v *) y i 

1=1 

for any x £ W n . It will be convenient to use the notation 



n + 1 
x 

n 



u 



JZ^fr-vj), !<*,,"< n + 1, 



and x %0 = (x,Uij) for x £ M ra . It follows from (|4,3p and (|4.4p that \v>ij\ = 1 for i ^ j and 
(4.5) Y x im u em = (n + l)x VxelT. 

To define W precisely, first pick a pair (J, J) of distinct elements of {1, . . . ,n+ 1} uniformly and 
independently of X. Let 

X' = X - 2X IJ UIJ . 
and W[ = (X',6i) as before. Using (|4.5p . one obtains 

E[W[-Wi\X] =--{X,9i) = --Wi, 
1 n n 

4 n+1 



E[(w>-w l )(w>-w j )\x]=——- el m e^{x £m )\ 

v ; e,m=i 

and so Corollary 13.51 applies with A = ^ and 

— Y 9! m 0j m (X £m ) 2 -Sij. 



12 M. MECKES 

The relevant moments were calculated in |18| : for I ^ m and p 7^ q, 



(4.6) 

and 
(4.7) 



r£m\2/ypq\2 _ ( n + l)(n + 2) 



E(x tm ) (x pq y 



(n + 3)(n + 4) 



1 if {£,m}n{p,q} = 0, 
3 if |{f,m}n{p,<;}| =1, 
6 if \{£,m}n{ P ,q}\ =2, 



E|A^ m | 3 < 3\/2. 



In order to estimate E£^-, decompose the resulting sum of terms involving K(X ) (X pq ) ac- 
cording to the size of {£, m} n {p, g} and use (|4.6() : 



(4.8) 



(n + 3)(ra + 4) 
(n + 



l)[n + 2) 



',m,p,q 



j2^ m 0j m ) + 2 E ^r^f+io^(^ m ) 2 (^ 

|{£,m}n{p,g}|=l 



fon\2 



In all of the sums in (|4.8D the indices range from 1 to n + 1, with £ ^ m and p ^ q and in the last 
term we have also used that ug m = —u m £. It follows from f)4.5f) that 



(4.9) 



£ ^jm = ( n + 1) ty) = ( n + 1)^, 



?f TO ) 2 (^ m ) 2 



< /£(flf»)4 /^>f)4<4^ 



By the Cauchy-Schwarz inequality, the definition of U£ m , and the ^4 triangle inequality, 
(4.10) 



n+l 



\i 1=1 



n+l 



\ E<^> 4 - 

\ £=1 



Also by the Cauchy-Schwarz inequality, 
(4.11) £ 0f m 0f0j m 0f< £ (0. 



lm\2iQlm\2 



< 



4nJ>| 



lm\2 1 rAm\1 

3 > - "'^rt ) C e i 

|{£,m}n{p,<?}|=l |{f,m}n{p,g}|=l 
Ij^m, p^q Ij^m, p^q 



Combining (@3J), (|4"U]) . (JUDJ), and (ICTj) . 



\ £,m,p,q 



'i.l 



< 



512 



71- 



n+1 



n+l 



and therefore the first error term in Corollary 13.51 is bounded by 

1 . 



A 



E 



£ 4 £ ^£ 



i=l A €=1 



n+l 



E<^> 
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To bound the second error term, we begin as in (|4.ip . using (|4.7p . the definition of U£ m , and the 
£3 triangle inequality to obtain 

3/2 



E\W - W\ 3 < { E \ W i ~ w i\ 3 ) 2/ ) 

^ i=i ' 



3/2 



n(n + 1) 



< 



< 



(n + 



(n + 



1 -«M m > 1 3 ) j 



it E E iei 3 Ei^ m i 3 

i=l ^£^ m 
3/2 



2/3 



The error bounds from Proposition 13.51 are now simplified similarly as in the proof of Theorem [2TJ 
For each i define Xi = (xj, . . . , by x\ = (Oi, ve). By (|4.4p . 



n+l 



2 n + l. 



n + l 



n 



n 



Therefore \\xi\\ 3 < \xi\\\xi\\^ < y^J^HaJilllj and so by the same reasoning as in (14. 2p . 



96^ 



i=i \j £=1 



n+l 



finally observe also that \\xi\\1 < |x| 2 to simplify the bound on d\. 



□ 



Theorem 12.31 may be proved by following the proof of Theorem 12.21 in the case k = 1, applying 
Corollary 13.31 in place of Corollary 13.51 Alternatively, one can follow the proof of |X8|, Corollary 
6], using Corollary 13.31 in place of the Stein's Kolmogorov distance version of Proposition 13.21 this 
amounts to the same thing. 

In [T5] Stein's Kolmogorov distance version of Proposition 13.21 was applied for arbitrary isotropic 
X (under various symmetry assumptions). Total variation estimates for the log-concave case were 
then deduced using [5J Theorem 3.3], which allows Gaussian approximation estimates for log- 
concave random variables to be transferred from Kolmogorov distance to total variation distance. 
The present approach entails less loss in the final total variation bound since it uses only one 
smoothing argument instead of two. In general, the second approach described above to prove 
Theorem 12.31 can be used to deduce total variation bounds of the same order as the Kolmogorov 
distance bounds in most of the results of [18] for log-concave random vectors. In particular, this 
applies to Theorem 1, Corollary 4(2), and parts of Corollary 5 of [18]. 

In a similar fashion, using Proposition 13.21 directly yields versions of many of the results of [18] 
for the bounded Lipschitz metric 

d BL (X,Y) = su V {\Ef(X)-Ef(Y)\ : ||/||oo 5 |/U<l}- 

In general, the dominant error term for the results of [18] using dsL is typically of the order n -1 / 2 , 
as opposed to the order ra -1 / 4 for the Kolmogorov distance in most of the results of [18j and for 
dxv in the present Theorem 12.31 

In [12], Klartag used another approach, based on an application of the classical Berry-Esseen 
theorem, to prove a univariate estimate in the setting of Theorem 12.11 Since Stein's method can 
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be used to prove the Berry-Esseen theorem, the approach taken here and in |18| is arguably more 
direct, and the total variation bounds which can be derived in this way are better than those 
derived by the method of [12] . However, since the original version of this paper was written, in p3] 
Klartag has given a proof of an optimal result for Kolmogorov distance, which, as discussed after 
the statement of Theorem [2J] above, implies sharper total variation bounds (when k = 1) than the 
methods used here. Klartag's proof is based partly on the optimal concentration result proved in 
[14] . and also on careful arguments similar to those in classical proofs of the Berry-Esseen theorem. 
It is not clear whether the Stein's method approach can achieve these optimal error bounds. 

Acknowledgements. The author thanks S. Chatterjee and E. Meckes for showing him an early 
version of [7] and E. Meckes for useful discussions. 
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